Rank | Count | Beginning |
---|---|---|
32472 | 6030 | ಈ |
24685 | 2458 | ಇದು |
15590 | 1726 | ಆದರೆ |
10885 | 973 | ಅವರು |
22459 | 960 | ಇದರ |
44257 | 817 | ಒಂದು |
13765 | 702 | ಆ |
21752 | 689 | ಇದನ್ನು |
9989 | 673 | ಅವರ |
48962 | 523 | ಕೆಲವು |
29770 | 508 | ಇವರು |
28234 | 467 | ಇಲ್ಲಿ |
65078 | 456 | ನಂತರ |
98113 | 402 | ಹೀಗೆ |
5820 | 382 | ಅದು |
21357 | 380 | ಇದಕ್ಕೆ |
23269 | 374 | ಇದರಲ್ಲಿ |
14169 | 366 | ಆಗ |
29257 | 335 | ಇವರ |
5087 | 316 | ಅದರ |
6274 | 310 | ಅದೇ |
18106 | 308 | ಆದ್ದರಿಂದ |
58886 | 308 | ತಮ್ಮ |
27233 | 294 | ಇದೇ |
15594 | 287 | ಆದರೆ, |
23856 | 283 | ಇದರಿಂದ |
58482 | 278 | ತನ್ನ |
30350 | 273 | ಇವು |
75696 | 270 | ಮತ್ತು |
6968 | 265 | ಅನೇಕ |
In the next four subsections show the most frequent sentence beginnings consisting of N words, N=1, 2, 3, 4. In this subsection we start with N=1.
The most frequent word-N-grams at the beginning of sentences give some insight into sentence composition.
Especially for N=1, we only need a small corpus to identify the most frequent sentence beginnings.
select substring_index(sentence, ' ', 1) as beg, count(*) as cnt from sentences group by substring_index(sentence, ' ', 1) order by cnt desc limit 50;
4.3.1.2 Most Frequent Sentence Beginnings II
4.3.1.3 Most Frequent Sentence Beginnings III
4.3.1.4 Most Frequent Sentence Beginnings IV
4.3.1.1 Most Frequent Sentence Endings I
4.3.1.2 Most Frequent Sentence Endings II
4.3.1.3 Most Frequent Sentence Endings III
4.3.1.4 Most Frequent Sentence Endings IV